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CROSS BAR MIILTTPATH RESOURCE CONTROLLER SYSTEM AND METHOD 
Background of the Invention 

This invention relates generally to a system and method for managing access to common 
resources in a multiprocessor system wherein it is desirable to provide the fastest access by the 

processors to the common resources. 

When a particular application/project/job requires more processing power than a single 
processor is capable of providing, it becomes necessary to provide a coprocessor, such as a 
digital signal processor (DSP) or a floating point unit (FPU). Thus, the tasks associated with the 
particular application are handled in unison by the main processor and the coprocessor. The 
most common conventional solution to solving the problem of how to allocate the resources to 
the multiple processors is to utilize a dual-ported memory subsystem wherein each processor has 
equal access to the common resources that may be used by both processors. Alternatively, each 
processor may be provided with a dedicated resource and a mechanism for transferring 
commands and data through a shared "Mail Box". The shared "Mail Box" typically includes a 
number of first in first out (FIFO) registers of various lengths. 

The conventional dual-ported memory solution provides processor independent design 
implementation, but requires a large amount of hardware for the random access arbitration for 
both processors. Consequently, the actual implementation of the arbitration logic and the 
random access for the common bus creates more delay on the common resources since the access 
to the common bus must be determined prior to accessing the common resources. The typically 
small degradation in the access speed in the dual-ported memory gets magnified by a significant 
amount when that common resource is the main memory because the main memory is the 
common resource most utilized by both processors. Therefore, the interdependency of the 
multiple processors increases since they both rely heavily on the main memory. 

The conventional dedicated resource for each processor with the shared "Mail Box" 
scheme prevents the multiple processors from competing with each other for the same resource, 
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but suffers greatly in terms of access speed since the data and commands must all pass through 
the "Mail Box" which has a relatively narrow throughput. In addition, duplicative resources are 
necessary since each processor has its own dedicated resources. Although the scheme works 
quite well when the tasks for the processors are well defined and common data transfer is 
5 relatively small, the actual performance and resource utilization suffers greatly when the tasks 
are not well defined and the processors are therefore more interdependent. 

In another conventional system for sharing common memory between one or more 
processors, the memory has gate logic associated with each processor and each bit in the memory 
has two access points (e.g., one access point for the first processor and one access point for the 
10 second processor for a two processor system). In this configuration, the contention between the 
D processor is reduced since each processor has its own access to each bit of memory. The 



fifi drawback with this approach is that there is large amount of duplicated logic, including the read 

y and write logic, to control the accesses by each processor. In addition, each memory cell is more 

f F# ** 
P + '» 

^? complex than a typical memory cell. In particular, a typical DRAM memory cell has a single 

H transistor and a capacitor whereas the memory cell for the dual access port memory has four 



kJ Thus, it is desirable to provide a cross bar multipath resource controller that overcomes 

the above limitations and problems with conventional multiprocessor resource management 
20 solutions and it is to this end that the present invention is directed. 

Summary of the Invention 

The cross bar multipath resource controller system in accordance with the invention 
overcomes the problem and limitation with conventional multiprocessor resource allocation 
solutions to provide the highest performance (e.g., the fastest access speed for each processor to 
25 the common resources) with a minimum number of resources. The cross bar multipath system in 
accordance with the invention uses a cooperative resource management system between the 



transistors for each cell since each cell may be accessed independently by the first or second 



processor. 
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multiple processors. In particular, all of the resources for both processors are connected to a 

shared cross bar bus and then each processor may negotiate for a priority access to the preferred 
locations in the resource through the use of hardware "Semaphores" in a preferred embodiment 
of the invention. 

In more detail, each processor releases the priority access privileges (in a cooperative 
manner) to any resources that are no longer needed by the processor so that the other processor 
may have priority access when needed. Consequently, when a large block of data needs to be 
transferred from one processor to the other processor, the processor that is transferring the block 
of data may release the priority access and let the other processor retrieve the data at the priority 
access rates. When a processor has the priority access to a particular memory, for example, the 
bus arbiter shorts the connection from the processor to the particular memory for normal 
operation so that no switching is required for the primary processor. When the secondary 
processor tries to access the same resource, the secondary processor would always wait for the 
idle time. Therefore, the cross bar multipath system in accordance with the invention works well 
for well orchestrated multi-processor applications especially with embedded systems with 
common operating systems. 



Thus, in accordance with the invention, a computer system having a multipath cross bar 
bus is provided wherein the computer system has one or more processors and one or more 
resources capable of being shared by the one or more processors. The system further comprises a 
resource controller and bus that is connected to each resource and to each processor wherein the 
resource controller is capable of permitting each processor to simultaneously access a different 
resource from the one or more resources. 

In accordance with another aspect of the invention, an apparatus for controlling the access 
to one or more computing resources by one or more processor is provided wherein the apparatus 
comprises a resource controller and bus that is connected to each resource and to each processor 
wherein the resource controller is capable of permitting each processor to simultaneously access 
a different resource from the one or more resources. In accordance with yet another aspect of the 
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invention an apparatus for controlling the access to one or more memory resources by one or 
more processors is provided wherein the controller comprising a memory resource controller and 
bus that is connected to each memory resource and to each processor so wherein the memory 
resource controller is capable of permitting each processor to simultaneously access a different 

5 resource from the one or more memory resources. In accordance with yet another aspect of the 
invention, an apparatus for controlling access by one or more processors to one or more 
peripheral resources is provided wherein the apparatus comprises a peripheral resource controller 
and bus that is connected to each peripheral resource and to each processor so wherein the 
resource controller is capable of permitting each processor to simultaneously access a different 

10 peripheral resource from the one or more peripheral resources. 

<■ ■» 

yo Brief Description of the Drawings 

b. ■( 

3 Figure 1 illustrates a typical multi-processor architecture; 

III I II " I 

; 

Figure 2 illustrates a cross bar multi-processor architecture in accordance with the 
s invention; 

U H 
* >fc 

"* *" \t 

f§ Figure 3 is a logical diagram illustrating the cross bar architecture of Figure 2; 

Figure 4 illustrates an example of two multi-processor tasks at different times, T 0 - T 4 , 
wherein the tasks are divided between two different processors; 

Figures 5A - 5E are diagrams illustrating the memory portions, M x - M 4 , at the different 
times, T 0 - T 4 , for the two tasks in Figure 4; 

20 Figure 6 is a block diagram illustrating a preferred embodiment of a multi-processor 

system including the cross bar multipath resource controller in accordance with the invention that 
includes a cross bar multipath memory controller and a cross bar multipath peripheral bus; 

Figure 7 is a block diagram illustrating more details of the cross bar multipath memory 
controller in accordance with the invention; 
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Figure 8 illustrates the bus access timing of the cross bar multipath memory controller of 
Figure 7; 

Figure 9 is a block diagram illustrating more details of the cross bar multipath peripheral 
bus in accordance with the invention; and 

Figure 10 illustrates the bus access timing of the cross bar multipath peripheral bus of 
Figure 9. 

Detailed Description of a Preferred Embodiment 

The invention is particularly applicable to a portable music system employing the secure 
digital music initiative (SDMI) having multiple processors and it is in this context that the 
invention will be described. It will be appreciated, however, that the cross bar multipath resource 
bus and method in accordance with the invention has greater utility, such as to any other systems 
that use multiple processors. To better understand the invention, an example of a typical multi- 
processor system will now be described so that the limitations of such a typical systems will be 
more readily apparent. 

Figure 1 illustrates a typical multi-processor architecture 30 wherein a first processor 32 
(CPU) and a second processor 34 (CPU2) must share the resources of the system including a 
static random access memory (SRAM) 36 and a math coprocessing unit (MPU) 38. In the 
typical system, the two processors must each access the SRAM using a single bus 39 at some 
point to either read data or write data. In this architecture, when the first processor is accessing 
the SRAM 36 using the bus 39, the second processor cannot access the SRAM at all and must 
wait for the completion of the memory access by the first processor. Similarly, when the second 
processor accesses the memory, the first processor must wait for access to the memory. This 
architecture therefore causes both processors to be inefficient since both cannot operate a full 
speed since both, during certain periods of time, have to wait for access to a shared resource. 
This architecture therefore effectively limits the advantages provided by the multi-processor 
architecture. Other conventional architectures such as embedded SRAMs or a dual port memory, 
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as described above, attempt to solve the above problem, but typically introduce other 

unacceptable limitations and drawbacks. The cross bar multipath resource controller in 
accordance with the invention as will now be described, overcomes the limitations and 
drawbacks with the conventional architectures and balances the need for processor speed with the 
5 desire to limit the duplication of hardware. 

Figure 2 illustrates an example of a cross bar multi-processor architecture 40 in 
accordance with the invention. In this example of the cross bar architecture 40, a two processor 
system is shown although the cross bar architecture may be used with any number of processors. 
The system may include a first processor 42 (Pj) and a second processor 44 (P 2 ) that may share 
10 the processing load for tasks being completed by the system as is well known for multi-processor 
systems. The system may further include a first bus 46 connected to the first processor 42 and a 
^ second bus 48 connected to the second processor 44. In this example, the processors are 
SI connected by the buses to a common memory 50 although the cross bar architecture may be 
,q used to connect two or more processors to a variety of different shared resources and is not 
T5 limited to the sharing of a memory. In accordance with the invention, the memory 50 (or 



whatever other resource is being shared by the processors) is divided into one or more portions 
(memory portions Mj - M 4 in this example) wherein the bus for each processor is connected to 
each portion of the shared resource as shown. Thus, the first bus 46 and the second bus 48 are 
both independently connected to each portion of the resource so that each processor may 



20 independently access each portion of the memory at different times. To control the access to 
each resource portion, the system may include a controller 52 that will be described in more 
detail below. Figure 3 is a logical diagram illustrating the cross bar architecture 40 of Figure 2 
wherein the first and second processors 42, 44 each may independently access a portion of the 
resource 50. 

25 In accordance with the invention, the first processor may access a first portion of the 

resource while the second processor may simultaneously access a second portion of the memory 
thereby keeping both processors working at full speed. If the two processors are working 
cooperatively and need to share data during a task, the first processor may access the first portion 
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of the memory during a first portion of the task and complete that portion of the task and then the 
access to the first portion of the memory may be immediately handed off to the second processor 
which may then access the data in the first memory portion. This scheme reduces the latency in 
the system since each processor may access data when needed provided that the instructions are 
generated in such a way as to encourage the sharing of the resource. Now, an example of a task 
being shared by the two processors shown in Figure 2 at different times, T 0 - T 4 will be described 
to provide a better understanding of the invention. 

Figure 4 illustrates an example of four multi-processor tasks (A, B, C and D) at different 
times, T 0 - T 4 , wherein the tasks are divided between two different processors and Figures 5 A - 
5E are diagrams illustrating the memory portions, M x - M 4 , at the different times, T 0 - T 4 , for the 
four tasks in Figure 4. In particular, the executable code for the multiprocessor system is written 
in such a way as to take advantage of the multi-processors and their processing capacity. In the 
example, four tasks (A, B, C and D) are shown. In particular, the four tasks have been split 
between the two processors wherein the first processor, P 1? performs the first part of each task 
and the second processor, P 2 , performs the second part of each task as shown. Then, while the 
second processor is completing the second part of the task, the first processor uses another 
portion of the resource to begin a next task. As shown in Figure 5 A, at time T 0 , the first and third 
memory resources (e.g., the instruction memory and the ping buffer) are currently being accessed 
by the first processor that is performing the first part of the first task (A x ). At time T 2 (as shown 
in Figure 5B), the first processor starts the first part of the second task (Bj) and accesses the 
second and fourth memory resources (M 2 and M 4 ) while the second processor starts and 
completes the second portion of the first task (A 2 ) using the first and third memory resources (M x 
and M 3 ) that were previously used by the first processor. The access is granted to the first and 
second processors by the controller shown in Figure 2. At time T 2 as shown in Figure 5C, the 
first processor has completed its part of the second task and has started the first portion of a third 
tasks (Cj) using the first and third memory resources while the second processor performs the 
second part of the second task (B 2 ) using the second and fourth memory resources. As shown in 
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Figure 5D, while the second processor is working on the second portion of the third task (C 2 ), the 
first processor requests and is granted access to the second and fourth memory resources so that 
the first processor can begin the first portion of another task (Dj). At time T 4 (as shown in Figure 
5E), the first processor has completed its tasks and is not accessing any of the memory resources 

5 while the second processor is completing the second part of the fourth task (D 2 ) using the second 
and fourth memory resources. In this manner, the tasks may be split between the processors and 
take advantage of the independent processing power of each processor. It may also take 
advantage of the two processors being able to work cooperatively as shown in Figure 4 with a 
minimum amount of latency. To accomplish the cooperative work, the tasks are apportioned 

10 between the two processors to take advantage of the dual processor system. Now, a preferred 

_ embodiment of the multi-processor system with the crossbar resource controller in accordance 

J3 with the invention will be described. 

„~ ~* t. 

-w r. 

%l Figure 6 is a block diagram illustrating a preferred embodiment of a multi-processor 

% system 60 including the cross bar multipath resource controller in accordance with the invention 

Hs that includes a cross bar multipath memory controller 62 and a cross bar multipath peripheral 

O controller 64. The details of the cross bar multipath memory controller 62 is described below 

g with reference to Figures 7 and 8 and details of the cross bar peripheral controller 64 is described 

^ > H 

LS below with reference to Figures 9 and 10. As shown, the system 60 may include a host processor 
H= 66 which may preferably be a reduced instruction set (RISC) ARM core made by ARM Inc and a 
20 coprocessor core 68 that operate in a cooperative manner to complete tasks as described above. 
In the preferred embodiment, there may also be a hardware accelerator engine 70 as shown. The 
host processor, the coprocessor and the hardware accelerator engine are all connected to the 
multipath memory controller 62 and the multipath peripheral controller 64 as shown. To control 
access to the shared resources connected to the multipath memory controller and the multipath 
25 peripheral controller, the system 60 may include a semaphore unit 72 which permits the two 

processors 66, 68 to communicate with each other and control the access to the shared resources. 
The details of the semaphore unit is described in more detail in copending US patent application 
number XX/XXX,XXX filed on XXXX,XX 2001 titled "XXX" and incorporated herein by 
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reference. The semaphore unit permits the processors to negotiate for the access to the shared 
resources as described above, but then, due to the multipath controllers 62, 64, permits the 
processors to access the resources over its own bus that is part of the controllers. To control the 
timing of the controllers 62, 64, a timer/clock 74 is connected to each controller 62, 64. 

5 Both the memory controller 62 and the peripheral controller 64 are then in turn connected 

to one or more resources that are shared by the processors. For example, the memory controller 
62 in this preferred embodiment is connected to a host instruction memory 76 that is typically 
accessed by the host processor 66, a ping buffer 78 is a general purpose memory resource that 
may be accessed by each processor, a pong buffer 79 is a general purpose memory resource that 
1 0 may be accessed by each processor and a coprocessor instruction memory 80 which is typically 
y accessed by the coprocessor 68. Due to a priority scheme and the cross bar architecture, the host 
B processor may always have priority access to its instruction memory 76 and the coprocessor may 
SJ always have priority access to its instruction memory 80 since the two processors each have 

-I 4' ti 

jS separate buses connected to each resource. The memory controller 62 may also be connected to 
H a cache memory 82, which is a well known 4-way 4 kB set associative cache in the preferred 
O embodiment, a flash memory interface 84 for connecting to an external flash memory and an 
p external synchronous dynamic random access memory (SDRAM) interface 86 with the various 
yi necessary signals, such as RAS, CAS, WE, OE and CS, to interface to a typical well known 
^ SDRAM. 

20 The peripheral multipath controller, which operates in a similar manner to the memory 

controller in that each processor may access different shared resources simultaneously, may have 
one or more peripherals connected to it. In the preferred embodiment, the peripheral controller 
may be connected to a universal serial bus (USB) interface 88 that in turn connects to a USB 
device or host, a universal asynchronous receiver/transmitter (UART) interface 90 that in turn 

25 connects to communication port (COM) hosts, a TAP/embedded ICE controller 92, an EIDE- 

CD/CF controller 94 to interface to hard disk drives or CD drives, a key matrix controller 96 that 
connects to a user input keyboard, an audio-codec controller 98 that connects to an audio 
coder/decoder (codec), an liquid crystal display (LCD) display controller 100 that connects to a 
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LCD display, a smartcard controller 1 02 for connecting to a well known smart card and an 

input/output (I/O) expansion port 104 that connects to one or more different input/output devices. 

As with the memory controller, the peripheral controller provides access for each processor to 
each shared resource. Now, more details of the cross bar multipath memory controller in 
accordance with the invention will be described. 

Figure 7 is a block diagram illustrating more details of the cross bar multipath memory 
controller 62 in accordance with the invention wherein some other elements of the system, such 
as the host processor 66, the coprocessor 68 and the semaphore unit 72 are also shown connected 
to the memory controller. In addition, the buffers and memories 76 - 82 connected to the 
memory controller 62 are shown. The memory controller may include one or more controllers to 
control the resources connected to the memory controller. In the preferred embodiment shown, 
the memory controller may include a first and second SRAM controller 1 10, 1 12 that control the 
instruction memories 76, 80 and a first and second buffer controller 1 14, 116 that control the 
ping buffer 78 and the pong buffer 79, respectively. The memory controller may also include an 
external memory controller 118 and a cache controller 120. The semaphore unit 72 permits 
communications of the resource allocations between the processors. 

To control the access to each shared resource, the memory controller may include a 
resource arbitration controller 122 that is connected to a first address controller 124 that is in turn 
connected to the first processor 66 and a second address controller 126 that is connected to the 
coprocessor 68. The address controllers and known as translate and look aside buffers which are 
well known. The resource arbitration controller may be connected to each resource controller 
1 10 - 120 to control the operation of the memory resources. To control access by each processor 
to the various memory resources, the resource arbitration controller 122 may also be connected to 
a first multiplexer (MUX) 128, a second MUX 130, a third MUX 132 and a fourth MUX 134 and 
control the operation of those MUXes as described below. In this embodiment, the first and 
second MUXes 128, 130 are also connected to the host processor 66 to provide it with data from 
the one or more resources (including memory resources and peripheral resources) over a memory 
data line, MD1 and the third and fourth MUXes 132, 134 are also connected to the coprocessor 
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68 to provide it with data from the one or more resources (including memory resources and 
peripheral resources) over a memory data line, MD2. As shown, each processor has its own 
memory data bus so that each can be accessing a different resource simultaneously. The first and 
second MUXes 128, 130 multiplex the buses from the different memory resources and the 

5 different peripheral resources over a peripheral data bus (PD1) and select one onto the single 
MD1 bus while the third and fourth MUXes 132, 134 multiplex the buses from the different 
memory resources and the different peripheral resources over a peripheral data bus (PD2) into the 
single MD2 bus. The first and third MUXes 128, 132 also multiplex peripheral data from a 
peripheral data bus (PD1 or PD2 depending on whether it is connected to the host processor or 

1 0 the coprocessor) into the memory data bus as needed. 

CJ in the embodiment shown, the first MUX 128 has a peripheral data bus PD1 input, a pong 

C9 buffer input, a ping buffer input and a bus connected to the coprocessor 68. As shown, the 

SI MUXes select one of these input, under control of the resource arbitration controller 122 as 

% shown by the "00", "01", "10" and "1 1" notations next to each input of the MUX. Similarly, the 

f$ second MUX 1 3 0 has an input which in the output of the first MUX, an input from the 

O coprocessor instruction memory 80, an input from the host processor instruction memory 76 and 

q an input from the cache 82. Thus, using the two MUXes 128, 130 in this preferred embodiment, 

■* 

seven different buses from seven different memory resources and peripheral resources may be 
directly accessed by each processor under the control of the memory controller. It should be 

20 noted, however, that each processor has a direct link to each memory resource and peripheral 
resource so that, for example, one processor may access its instruction memory while the other 
processor simultaneously accesses its own instruction memory. In accordance with the 
invention, if there are more memory resources, additional cascaded MUXes may be provided as 
is well known. Thus, for example, three MUXes may be used to select one signal from 10 

25 different resources, four MUXes may be used to select one signal from 13 different resources and 
so on. As with the first MUX 128, the second MUX 130 is also controlled by the resource 
arbitration controller 122 as shown by the logic notations next to each input of the MUX. 
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Similarly, the third and fourth MUXes 132, 134 connect the seven lines from each 
memory resource and peripheral resource to the coprocessor wherein the MUXes select one 
signal based on control signals from the resource arbitration controller 122 as described above. 
The resource arbitration controller 122 may be, for example, a microcontroller device running a 
5 piece of software code/instructions or hard coded instructions wherein it may generate the 

appropriate controls for the MUXes 128 - 134 based on data from the semaphore unit 72 and the 
address controllers 124, 126. 

In operation, based on the instructions being executed by the processors as indicated by 
the semaphore unit 72 and data being received by the address controllers 124, 126, the resource 
10 arbitration controller may grant access to a particular memory or peripheral resource to a 

particular processor and then control the one or more appropriate MUXes to connect the 
ffj requested resource to the particular processor. For example, if the host processor 66 requests 
Si access to the host processor instruction memory 76, the resource arbitration controller 122 grants 
Ji it access since the host processor has priority access as described above. The controller 122 may 
f5 then send a message to the SRAM controller 110 indicating that it is going to be accessed by the 
□ host processor. The controller 122 may then (or simultaneously) send a "01" data string to the 
p second MUX 130 to program it to pass the signal from the instruction memory bus through to the 
L2 memory data bus MD1 so that the host processor can access the instruction memory. When the 

j^gf- processor has completed its access, it may release the bus so that, for example, the 
20 coprocessor could in fact access the instruction memory for the host processor in a similar 

manner. 

The architecture shown allows priority access to shared memory resources through the 
hardware semaphores generated by the semaphore unit 72 as will now be described. Now, the 
operation of the memory controller will be described in more detail with reference to Figure 8. 

25 Figure 8 illustrates the bus access timing of the cross bar multipath memory controller of 

Figure 7. In the preferred embodiment, each processor may be assigned a priority status at 
different times based on a bit of a semaphore from the semaphore unit so that the processor that 
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has been given the priority is always able to access the particular shared resource while the non- 
priority processor must wait to access the shared resource. Provided that the instructions being 
executed by the processors is appropriately constructed, the priority scheme set by the 
semaphores may track those instructions so that the non-priority processor is less likely to need a 
5 shared resource during the time that it does not have priority access. The processor with priority 
access has "zero-wait" access in that it can access a shared resource at any time. 

The timing diagram in Figure 8 depicts the worst case condition where the host processor 
66 and the secondary processor (COP) 68 are accessing a common resource block (RAMO). 
During normal operation when the processors are not both requesting the same shared resource, 
10 each processor can simultaneously access the resource due to the multipath controllers. In the 
% preferred embodiment, depending on the state of the Arbitration Priority Semaphore bit (BO- 
S SEMA) as shown in the timing diagram, the access latency is modified for each processor. In the 
Si example shown in the diagram, the first half of the timing diagram shows when host processor 66 
5 has the priority for the SRAMO resource while the second half of the timing diagram shows when 
T5 COP 68 has the priority for accessing the SRAM1 memory block. As can be seen from the 
Q timing diagram, each processor, when it has priority, gets the "zero-wait" cycle accesses from it's 
O privileged resources while the non-priority processor gets the "one-or-more-wait" cycle accesses 

****** 

^ for it's non-priority resources. In accordance with the invention, a processor may have priority to 
^ all of the resources during a period of time and then the other processor may have priority. Also 
20 in accordance with the invention, certain resources may be priority resources for one processor at 
all times, such as the host processor instruction memory resource for the host processor, while 
the other processor may have priority access at all times to other resources, such as the 
coprocessor instruction memory for the coprocessor. Alternatively, there may be some resources 
which are always a priority resource for a particular processor and other resources which are a 
25 priority resource for different processors at different times. 

Returning to the diagram, it should be noted that the non-priori processor (e.g., the 
processor that does not currently have priority access to the particular shared resource) will wait 
indefinitely for the next free cycles when consecutive access are made from the privileged 
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processor. Fortunately, this would not be a valid situation, since the arbitration is based on the 

availability of each resource blocks (RAM) instead of the processor requests. Considering that 
each processor must fetch instruction and then request data, our Zero-Wait cycle internal Ping- 
Pong Buffers should have more then sufficient memory band-width for effective dual-processor 
applications. Note also that the arbitration is based on the target resource so that the target 
resource clock (SysCLK) instead of the processor clock is used. Care should be given that 
different processor clock rates may affect the arbitration scheme, but it should always work for 
the maximum performance for the processor with the priority access. Now, the multipath 
peripheral controller and bus in accordance with the invention will be described in more detail. 

Figure 9 is a block diagram illustrating more details of the cross bar multipath peripheral 
controller and bus 64 in accordance with the invention wherein other elements of the system are 
shown as well, such as the host processor 66, the coprocessor 68, the semaphore unit 72 and the 
one or more peripheral interfaces 88 - 104 that interface to the one or more peripherals. The 
multipath peripheral controller 64 may include one or more controllers for the different 
peripherals such as a host controller 150, a TAP controller 152, a EIDE controller 154, a matrix 
controller 156, a codec controller 158, a display controller 160, a smart card controller 162 and 
an IO expansion controller 164 as shown. These controllers are controlled by a resource 
arbitration controller 166 that control the access of the processors to these shared peripheral 
resources similar to the access control of the memory resources. 

The resource arbitration controller 166, that may be a microcontroller executing one or 
more pieces of code or microcode to send arbitration commands to the peripherals in a preferred 

embodiment, may include a first and second address controller 168, 170 that communicate with 
the address controllers of the resource arbitration controller for the multipath memory controller. 
As with the memory controller, to select a signal from a peripheral to be provided to each 
processor, the peripheral controller 64 may include a first multiplexer (MUX) 172, a second 
MUX 174, a third MUX 176 and a fourth MUX 178. Each MUX is controlled in order to select 
a signal from one or more signals and output the selected signal as is well known. In this 
embodiment, the first MUX 172 outputs a signal onto a peripheral data bus (PD1) indirectly to 
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the host processor and the third MUX 176 outputs a signal on another peripheral data bus (PD2) 
indirectly to the coprocessor. The MUXes indirectly output the signal since those signals are 
actually fed into MUX 128 and MUX 132 as shown in Figure 7 and then fed into each processor 
over the memory data buses (MD1 and MD2 as shown in Figure 7) assuming the appropriate 
5 control signals to the MUXes 128, 132 are provided. Thus, the signal for the particular 
peripheral is selected by the MUXes 172, 174 and 176, 178, but it is communicated to the 
processors over the memory data buses due to the MUXes 128, 130. 

As with the memory controller, the MUXes 172, 174, 176, 178 are controlled to select a 
bus containing a signal from a particular peripheral to route that signal to a particular processor. 
10 As with the memory controller, the multipath architecture permits each processor to 

simultaneously access a shared peripheral resources as long as both processors do not need the 
33 same shared resource. The operation of the peripheral controller and bus is similar to the 
SJ memory controller and therefore will not be described herein. However, the timing diagram for 

J5 the peripheral controller will now be briefly described. 



Figure 9. The peripheral bus architecture allows equal access for both processors to all of the 
peripheral shared resources. Therefore, unlike the memory controller that has priority access to 
certain memory resources, the host processor 66 and coprocessor 68 have equal access to each 
resource. The normal mode of arbitration is to give access to a resource to the first processor that 



20 requested access to the resource. When both processor request access to the same resource (e.g., 
for the same peripheral device) a typical round-robin type of fair access arbitration scheme may 
be utilized although other well known arbitration may also be used. Consequently, due to the 
equal access of the processors, the peripheral device accesses would always be one-or-more wait 
bus cycles in contrast to the memory resource controller. 

25 While the foregoing has been with reference to a particular embodiment of the invention, 

it will be appreciated by those skilled in the art that changes in this embodiment may be made 



Figure 10 illustrates the bus access timing of the cross bar multipath peripheral bus of 
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without departing from the principles and spirit of the invention, the scope of which is defined by 
the appended claims. 
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